Authorship attribution of source code by using back propagation neural network based on particle swarm optimization
نویسندگان
چکیده
Authorship attribution is to identify the most likely author of a given sample among a set of candidate known authors. It can be not only applied to discover the original author of plain text, such as novels, blogs, emails, posts etc., but also used to identify source code programmers. Authorship attribution of source code is required in diverse applications, ranging from malicious code tracking to solving authorship dispute or software plagiarism detection. This paper aims to propose a new method to identify the programmer of Java source code samples with a higher accuracy. To this end, it first introduces back propagation (BP) neural network based on particle swarm optimization (PSO) into authorship attribution of source code. It begins by computing a set of defined feature metrics, including lexical and layout metrics, structure and syntax metrics, totally 19 dimensions. Then these metrics are input to neural network for supervised learning, the weights of which are output by PSO and BP hybrid algorithm. The effectiveness of the proposed method is evaluated on a collected dataset with 3,022 Java files belong to 40 authors. Experiment results show that the proposed method achieves 91.060% accuracy. And a comparison with previous work on authorship attribution of source code for Java language illustrates that this proposed method outperforms others overall, also with an acceptable overhead.
منابع مشابه
Experimental and finite-element free vibration analysis and artificial neural network based on multi-crack diagnosis of non-uniform cross-section beam
Crack identification is a very important issue in mechanical systems, because it is a damage that if develops may cause catastrophic failure. In the first part of this research, modal analysis of a multi-cracked variable cross-section beam is done using finite element method. Then, the obtained results are validated usingthe results of experimental modal analysis tests. In the next part, a nove...
متن کاملTraffic Signal Prediction Using Elman Neural Network and Particle Swarm Optimization
Prediction of traffic is very crucial for its management. Because of human involvement in the generation of this phenomenon, traffic signal is normally accompanied by noise and high levels of non-stationarity. Therefore, traffic signal prediction as one of the important subjects of study has attracted researchers’ interests. In this study, a combinatorial approach is proposed for traffic signal...
متن کاملOptimization of ICDs' Port Sizes in Smart Wells Using Particle Swarm Optimization (PSO) Algorithm through Neural Network Modeling
Oil production optimization is one of the main targets of reservoir management. Smart well technology gives the ability of real time oil production optimization. Although this technology has many advantages; optimum adjustment or sizing of corresponding valves is still an issue to be solved. In this research, optimum port sizing of inflow control devices (ICDs) which are passive control valves ...
متن کاملOptimizing the Prediction Model of Stock Price in Pharmaceutical Companies Using Multiple Objective Particle Swarm Optimization Algorithm (MOPSO)
The purpose of this study is to optimize the stock price forecasting model with meta-innovation method in pharmaceutical companies.In this research, stock portfolio optimization has been done in two separate phases.The first phase is related to forecasting stock futures based on past stock information, which is forecasting the stock price using artificial neural network.The neural network used ...
متن کاملComparative Analysis of Neural Network Training Methods in Real-time Radiotherapy
Background: The motions of body and tumor in some regions such as chest during radiotherapy treatments are one of the major concerns protecting normal tissues against high doses. By using real-time radiotherapy technique, it is possible to increase the accuracy of delivered dose to the tumor region by means of tracing markers on the body of patients.Objective: This study evaluates the accuracy ...
متن کامل